Sub-band weighted projection measure for robust sub-band speech recognition
نویسندگان
چکیده
In recent years, sub-band speech recognition has been found useful in robust speech recognition, especially for speech signals contaminated by band-limited noise. In sub-band speech recognition, full band speech is divided into several frequency sub-bands and then sub-band feature vectors or their generated likelihoods by corresponding sub-band recognizers are combined to give the result of recognition task. In this paper, we concatenate sub-band feature vectors, where we extract phase autocorrelation (PAC) MFCC, as noise robust features, from each sub-band. Furthermore, we extend a model adaptation method, named sub-band weighted projection measure (SWPM), to adapt HMM Gaussian mean vectors to concatenated sub-band feature vectors in noisy conditions. The experimental results indicate that the proposed method significantly improves the sub-band speech recognition system performance in presence of additive noise.
منابع مشابه
IMPROVED HMM ENTROPY FOR ROBUST SUB−BAND SPEECH RECOGNITION (ThuPmOR1)
In recent years, sub−band speech recognition has been found useful in robust speech recognition, especially for speech signals contaminated by band−limited noise. In sub−band speech recognition, full band speech is divided into several frequency sub−bands and then sub−band feature vectors or their generated likelihoods by corresponding sub−band recognizers are combined to give the result of rec...
متن کاملMulti-Channel Sub-Band Speech Recognition
Two distinct fields of research into robust speech recognition are the use of microphone arrays for signal enhancement and the use of independent frequency sub-band models for robust recognition. In this article, we propose and investigate the integration of these two techniques on two different levels. First, a broad-band beamforming microphone array allows for natural integration with sub-ban...
متن کاملMaximum likelihood sub-band weighting for robust speech recognition
Sub-band speech recognition approaches have been proposed for robust speech recognition, where full-band power spectra are divided into several sub-bands and then likelihoods or cepstral vectors of the sub-bands are merged depending on their reliability. In conventional sub-band approaches, correlations across the sub-bands are not modeled and the merging weights can only be set experientially ...
متن کاملMaximum likelihood sub-band adaptation for robust speech recognition
Noise-robust speech recognition has become an important area of research in recent years. In current speech recognition systems, the Mel-frequency cepstrum coefficients (MFCCs) are used as recognition features. When the speech signal is corrupted by narrow-band noise, the entire MFCC feature vector gets corrupted and it is not possible to exploit the frequency-selective property of the noise si...
متن کاملMulti-resolution cepstral features for phoneme recognition across speech sub-bands
Multi-resolution sub-band cepstral features strive to exploit discriminative cues in localised regions of the spectral domain by supplementing the full bandwith cepstral features with subband cepstral features derived from several levels of sub-band decomposition. Mult-iresolution feature vectors, formed by concatenation of the subband cepstral features into an extended feature vector, are show...
متن کامل